Open Domain Question Answering


FLEx: Language Modeling with Few-shot Language Explanations

Add code
Jan 07, 2026
Viaarxiv icon

From Chains to Graphs: Self-Structured Reasoning for General-Domain LLMs

Add code
Jan 07, 2026
Viaarxiv icon

AM$^3$Safety: Towards Data Efficient Alignment of Multi-modal Multi-turn Safety for MLLMs

Add code
Jan 08, 2026
Viaarxiv icon

VietMed-MCQ: A Consistency-Filtered Data Synthesis Framework for Vietnamese Traditional Medicine Evaluation

Add code
Jan 07, 2026
Viaarxiv icon

MARVEL: A Multi Agent-based Research Validator and Enabler using Large Language Models

Add code
Jan 06, 2026
Viaarxiv icon

pdfQA: Diverse, Challenging, and Realistic Question Answering over PDFs

Add code
Jan 06, 2026
Viaarxiv icon

FormationEval, an open multiple-choice benchmark for petroleum geoscience

Add code
Jan 05, 2026
Viaarxiv icon

Reasoning Over Recall: Evaluating the Efficacy of Generalist Architectures vs. Specialized Fine-Tunes in RAG-Based Mental Health Dialogue Systems

Add code
Jan 04, 2026
Viaarxiv icon

AdaGReS:Adaptive Greedy Context Selection via Redundancy-Aware Scoring for Token-Budgeted RAG

Add code
Dec 31, 2025
Viaarxiv icon

Retrieval Augmented Question Answering: When Should LLMs Admit Ignorance?

Add code
Dec 29, 2025
Viaarxiv icon